Learning with options: Just deliberate and relax

نویسنده

  • Pierre-Luc Bacon
چکیده

Bounded rationality is a very important framework for understanding rationality in both natural and artificial systems. In this paper, we aim to bridge the gap between bounded rationality and reinforcement learning, which has also proven very fruitful in both types of intelligent systems. A lot of reinforcement learning work has focused on Markov Decision Processes, where optimal policies can be obtained under certain assumptions. However, optimality does not take into account possible resource limitations of the agent, which is assumed to have access to a lot of data and computation time. One approach for reducing the costs of such agents has been to provide them with temporally extended actions and models, which allow policies to be computed faster, cf. [Dietterich, 2000; Precup, 2000]. However, the problem of automatically finding good temporal abstractions has proven very difficult. Part of this difficulty stems from the fact that from the point of view of absolute optimality, temporal abstractions are not necessary: the optimal policy is achieved by primitive actions. Therefore, it has been difficult to formalize in what precise theoretical sense temporally abstract actions are helpful.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing reading tasks to maximise vocabulary learning

Most vocabulary learning should occur incidentally through listening and reading. This is one of the  reasons  why  a  substantial  extensive  reading  program  is  an  important  part  of  an  English course.  Extensive  reading  requires  the  learners  to  do  large quantities  of  reading  using  material that  is  at  the  right  level  for  them.  Vocabulary  learning  occurs  through  th...

متن کامل

Learning Martingale Measures to Price Options

We provide a framework for learning risk-neutral measures (Martingale measures) for pricing options. In a simple geometric Brownian motion model, the price volatility, fixed interest rate and a no-arbitrage condition suffice to determine a unique risk-neutral measure. On the other hand, in our framework, we relax some of these assumptions to obtain a class of allowable risk-neutral measures. We...

متن کامل

Factors Affecting Deliberate Learning in First Year Students of Nursing and Midwifery School of Isfahan University of Medical Sciences

Introduction: Medical related professions require continuous deliberated learning in order to promote medical services. It is optimal that students start their professional education with adequate understanding and ability for deliberated learning. This study was conducted to assess the validity and reliability of the questionnaire designed for assessing deliberated learning. Methods: This is ...

متن کامل

Learning Martingale Measures From High Frequency Financial Data to Help Option Pricing

We provide a framework for learning risk-neutral measures (Martingale measures) for pricing options from high frequency financial data. In a simple geometric Brownian motion model, a price volatility, a fixed interest rate and a no-arbitrage condition suffice to determine a unique risk-neutral measure. On the other hand, in our framework, we relax some of these assumptions to obtain a class of ...

متن کامل

Comment on “Simulation-Based Mastery Learning with Deliberate Practice Improves Clinical Performance in Spinal Anesthesia”

Udani et al. have presented the results of a straightforward study on the effect of simulation-basedmastery learning with deliberate practice on clinical performance in spinal anesthesia [1]. And the results of the study are equally straightforward: simulation-based mastery learning with deliberate practice appears to improve the skills of learners. However, there are a number of reasons to be ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015